1 „ rtT> The invention also provides a method of operating a 

APPARATUS AND^ETHOD FOR ^^^B.ta^^JfeJjhJ 

ACCESSING A BRANCH TARGET BUFFER £ of MCh ls t0 hold only 

, fn sdertcdbits of a fetch address of each branch instrucUon 

^epresentinventionrelatestocomputersyst^ando sel dtheRAM f each partition is arranged to hold only 

branch Lget buffers for use in the supply of instructions to ^ rf ^ ^ ^ of branch instrnctio^m 

a processor of a computer system. ft buffer , whereifl S aid selected bits of both the fetch 

To run a computer program a "ggJ^J »o respective address, 

caries out a sequence of functions, typicalty fetching an ^ enabks a significant reduction in the storage capacity 

I^^^^^^SS^SS and overaU size necessary for the CAM *ray and data 

accessing an operand for use oy xne w» ti RAMs. a . 

the instruction, storing the ^ «^ "Jl w iU Preferably a read operation of said branch target buffer is 

determining the next instruction address. A problem w mi b t si ^ relaU ng to fetch 

occur where an instmction J^J^JSS aaSSa, a first input comprises some of said selected 

determinesthe address of thenext ^^^ 0 ^^T K^ 0 f a fetch address to test for a ache hit in any cache 

AnmstructionoftMstypeislmownasacondi^ biUofa te«n ^ ^ . mi a ^ input . 

The consequence of the presence of a ™<f™f ™ to the remainder of said selected bits of the 

instruction is that the instruction typically has Jo .pass w gJJ*^ said input controlling masfang or- 

Sgh several pipeline stages before die test is resoWed a ^ Mt output &0 m any partition not , 

and before the next instruction to be fetehed can be deter cutty p todatainsaid second input, 

mined with certainty. This can delay the ^pehne process^ C °^ v J on ^ to a branch target buffer comprising 

A program sequence may also inc ude ^°^ on J ™ cache memory, each partition comprising a 

jump instructions. Such instructions, if executed ^ ult m M JJJJ^J* selected bits of the fetcl laddrea soff 

program sequence jumping to a new instruct ion_ CAMarra y J $ ^ ft ffa , ^ 

Forlhepurposes of the present description and arSy m each^artiuon_coi^^ 

c^difi^naUnrnon^oidi&ju^ . . Actions held in the buffer, said selected bits of fee fetch 

m this specification the term instruction includes pnmi- 30 d dected bits of said target address each com- ■■ 

tiv^o^ aS which may.be included*! a VLIW system -JJJ bits of fte resp ecuve address 

SUMMARY OF THE INVENTION 35 ^^^^J^ 

The invention provides a method of ^ ^ said most significant bits being found by said companson 
instruction into a branch target buffer which ~?^« a . circuitty . 

BRIEF DESCRIPTION OF THE DRAWINGS 

ll^^JS^S— ^G^showsa^schematicdiagramofcircuitryfor 
^rSKfflStcaaddressofthe fc^meUig of this invention; 

S iSuof^ «g*« first selected bits of 45 3 shows a partial block diagram of an embodiment 

^Se?o?saidbrandiEc«ion .M-- of a branch target buffer for implementing the invention^ 
*amtchbetweensaidbits.writmgsecondselec^ an illustrative circuit diagram of read 

S Sh address of said instruction to a CAM " cicuitry of a partition of the branch target buffer of FIG. 3, 

address data indicative of _tte target a seC0 nd Dr f^tion values for an instruction; 

A write operation into said branch target buffer is not computeT system, and 
c^mpSSmparing said first selectedbits of the fetch 60 ^ Q , shQWS a ^ter system including the branch 
address and the target address no match is founcL buffer of nG . 3. 

: zssr~ 

address. 
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mstruction js represented by an instruction number (0-19) the first instruction 12 of fourth instruction word 104 (if the 

and by an address indicative : of a storage location at which conditional jump at instruction 9 is not effected) or executing 

toe instruction is stored and which is represented by the *e second instruction 13 of fourth ins^ction worf 104ftf 

binary equivalent of the instruction number. The instructions ^nH;n rt n.i • ^ ^ * . " ? worQ AW l" 

aremusfrativelygroupedintosequencesoffour Z£ - ^onal jurnp mstructzon 9 is effected). Finally; the 

101.102.103/l£Yo5^^^ 5 S^2ffiSr^^^ fc ^ 

instruction word. Each instruction word has a single word ^ f ±* XCUB rf ^ to instri,ctioQ 

address, that of instruction word 101 being "000" that of word 104 has been executed or if the 

instruction word 102 being "001", and so on. Each instruc- c f t ndlti f onal J um P Ruction 14 is not effected, or the 

tion has a byte address indicating its position within the „ w ™ " " M * * Mkm 2 ° f 

word, so that the first instruction of each word has bvte W mstIucll0n ™ d m - 

address "(XT. the second "OF and so on. The majority of the For ^^ni speed of operation, the computer, having 

instructions, specifically instructions 0-4, 6. 8, 10. 11. 13 * etcned a given instruction word for execution, should next 

and 15-19 lead directly onto the next sequentiaiinstiiction: !" £Ich * e coirect ^^^n word, in the sense of the 

these instructions are marked *X\ Instruction 12 (marked , 5 instruction word containing the next instruction which is 

*j*) is termed an unconditional jump, because if execution 12 re ^ u ™ 1 10 06 «ecutedL However, it will be seen from the 

reaches instruction 12 the result is to jump to instruction 16. fore S°iag that the identity of the next instruction to be 

Instructions 5. 7. 9 and 14 (marked 'cj') are each termed €Xecuted may vary if the current instruction word contains 

conditional jumps. The next instruction to be executed after any branch Mstruction. Ta & for example fourth instruction 

a conditional jump typically depends upon a condition V( word 104: 

tested, and may for instance be determined by whether the ' Instruction word 104 may be fetched either in response to 

value of the operand, stored as a result of the instruction executing instruction 9 or instruction 11 of third instruction 

which immediately precedes the conditional jump in time. word 103 or t0 executing instruction 5 the second instruction 

exceeds a given value. The instruction to which the jump word 102 - T& e next correct instruction word to fetch after 

will occur if the condition is met is indicated in the figure. worci ^ k cM«r first instruction word 101 or firm instruc- 

Thus instruction 5 will either be succeeded in time by ' uon wor<i l** 5 * By examining fourth instruction word 104. 

instruction 6. or by instruction 14 depending on the value ^ reader w2i aote that if instruction 11 of third instruction 

returned b y instruction 4. Instruction 7 will be succeeded hy word 103 were executed, then the first instruction 12 of 

instruction 8 or by instruction 0 depending upon the value fourth" mstruction wordllw would be executed next, andthe 

returned by instruction 6. Instruction 9 will be succeeded by - 0 lt ^ x of mat . would be that fifth instruction word 105 is the 

instruction 13 or by instruction 10 depending upon the value - _> n2Xt word t0 f ctcn * % however fourth instruction word 104 

returned by instruction 8. Instruction 14 will be succeeded werc fetched as a result of the conditional jump instruction 

either by instruction 15, or by instruction 2 depending upon * or ^ conditional jump instruction 5. the identity of the 

the value returned by instruction 13. " next correct instruction word to fetch depends upon the 

In some computers, the instructions will be fetched fiom 35 outc °roe of the conditional jump in instruction 14. 

store individually, with instruction 0 fetched first, and then * Continuing to consider the fourth instruction word 104. 

instruction 1 fetched, and so on. However, in certain fetching a wrong word (for example, first instruction word 

computers, it has been proposed to associate plural instruc- 101 being fetched instead of fifth instruction word 105) 

tions together in sequence called instruction words, so mar would result in the processing p^eline containing a number 

fetching from an addressable location makes available the ± 3 "of wrong instructions which would of course not be 

plural instructions of the word for execution. In the present ' executed. The practical consequence of this would be that a 

description, the term instruction word is not intended to be number of cycles would be lost during which no useful 

limitative and is used for convenience to refer to a plurality execution took place; only on a subsequent correct cycle 

of instructions, or one VLIW instruction, which may be would the correct fifth instruction word 105 be executed, 

fetched by accessing a sifigte address in store. ^ 5 after being ealled up into the pipeline. 

In the example of instruction words shown in FIG. L the A simplified device illustrating some of the features of the 

first instruction word 101 contains no branch instructions, ie. present invention will now be described with respect to FIG. 

no conditional jumps and no unconditional jumps. The 2. " 

^i^T^ T d l c 2 ^° J ? nditional Veiling to FIG. 2. the device comprises memory cir- 

SEES ;^f C ^? 5 ^ dr ^ c ^mstoctionword « cuitry 200 which consists of a memory ^ray 205 having a 

™<" n ^^^ plurality of addressable locations and an address decoded 

The fourth mstruction word 104 contains one unconditional 210 for addressing the memory array 205. Hie address 

S£n^ U ?£ ^Tn 2 «£™ conditional jump decoder 210 has an input 201 over which it receives instruc- 

ms^cuon. instruction 14. The fifth instruction word cor, tion addresses within a single instruction word. T*e address 

tarns no jump or condmonal jump instructions. 53 produces address outputs co^^g t0 &c k $ t 

It will be appreciated that the program flow due to instruction address and all later instruction addresses up to 

executing the first mstruction word 101 will always be the the end of the relevant instruction word. The number n of 

same, namely executing the first instruction 4 of second storage locations in the memory array 205 is at least equal 

mstruction word 102. Executing the second instruction word to the number of instructions in the program sequence 

10- causes three possible program flows, namely executing to currently being run. The memory locations of the array 205 

me first instruction 8 of third instruction word 103 (if no store at the address of respective instructions, information 

conditional jump is effected), executing the second instruc- showing whether the instruction is a branch instruction or 

tion 13 of the fourth instruction word 104 (if the conditional not. Le, a logical 0 where the mstruction is not a branch 

jump at instruction 5 is effected) or executing the first instruction, and a logical 1 if the instruction is a branch 

instruction 0 of first instruction word 101 (if the conditional 65 instruction; Each memory location has a respective output 

jump at mstruction 7 is effected). Sumlarly. me outcome of line 220,-220.. The output hnes 220.-220 arecoupled to 

executing the third instruction word 103 is either to execute a register arrangement 250. having storage locations corre- 
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5 f Tn the nresent example, the first instruction word 101 

store logical 1 wnS an associated sec A Qnd Action word 102. For the ^^^J 

predicted as taken, .and a logic* 10 w an £ m ^ memory drcuitry stores a logical al J 

instruction address is input locati ons of the bvte positions in each word being designated 1 to 4) oi m« 

decoder 210 addresses the f""*°» J ^ ffespondin g ^^^^.logMl-^iddresscoiwpondiBg 

output on the correspond^ on of tt P Log^ P cotrespoading to the second posmon 

220,-220,, which m ^f^f e */f corresponding SfiSi output "fine 220,-220,, corresponds to fte 

tog logical 0 if Jieto jg^SlSSi fc not a branch ^ovides logical one outputs ^«*"*££g 

will always be logical 0. The ouq>ux _un i instruction 7 is effected. . 

register 250 form word MestuW e 30 ft w jump d ^ ^^rmtry 

target addresses of branch '^^^^L^. - ^Sm^^^^^^f^ 
JtawdteitJiiJpit*^^ SJand the decision circuitry outputs the target address 

r^^^^^^J^^JL so 5?SfJ?««hi predicted-taken branch faro** 

correspond to b ran * with prediction m S * ™ ^Tfor example described with respect to FIG. 

tne circuitry of HG. 2 is not sho^n ^Mai^^ 

Once the irtitialisation £^*^VteK ffint'tn uo^nditional jump, and is always therefore 
sequence ^^^"ctionwordlOlisfed 55 J**" as ^ would not be presented on one of the 

logical zero at the second output 30- 



i 

V 



( - 



7 8 

word, and in the example described above with reference to Referring to FIG. 4. partition 401 consists of n content 

FIG. 1. this number is 4. It will of course be understood that addressable memory cells 510^510^ coupled to the instruc- 

other numbers could be used according to the content of the tion word address input bus 500, havino a first plurality of 

instruction word; specifically in a more complex system 8 CAM output lines 511^511^. As wilf be later described 

partitions could, be used. It will also be appreciated by one 5 herein, the present embodiment of the branch tar*et buffer is 
skilled in the art that fewer partitions could be provided. For . operated dynamically, in the sense that once all of the lines 

example, it would be possible for the four-byte example of except one contain data the partition is reoarded as *tu1T 

FIG. 1 to only provide two partitions, one corresponding to The one unfilled line is retained for writing a new entry and 

the first two bytes of each instruction word, and the other as part of the write operation, details of one branch instruc- 

corresponding to the second two bytes of each instruction J0 tion stored in the branch target buffer are discarded so aVto 

7 4 , e . . , A „ , be ready for input of a next newly-found branch instruction 

In the present example first partition 401 stores data A braQCQ mstruction * said t0 £ << Dewly . foun(r J " ^ 

byte or a word stored in the buffer, second partition 402 ft , a , OIIrto *i. v , . , ~ T l~* * 1U1 

stores datarelevant to the branch instructions at second byte IT'l ' 1 r & * ^ * ^ be n ° ted a 

locitioiis^soon.Hmprttion^lis^riiS 15 ^ was cUscarded from the branch 

as the lowest partition, and fourth partition 404 as the ^\ b f*™ 0ne ? cle °! °P cratI0n ™9 ***** * 
highest partition. newly-found instruction during a later cycle of operation. 

Each partition of the buffer 400 includes a CAM array 1 ?" S ' Whereas Rasing the number n of content address- 
holding a plurality of word addresses and associated data * ble . memo ? ceUs mcrcas « complexity and size of the 
RAMs holding target addresses for branch instructions » devic ?* such mcrcase tcnd * to reduce the number of 
located at the word addresses stored in the CAMs. The buffer ? ccasioas °* which a branch instruction needs to be written 
has first and second input buses 500.501 giving the address iD ' 

of an instruction being fetched. The first input bus 500 is an Decreasing the number n of memory cells provides a 

instruction word address bus receiving the most significant smaller and simpler device, but with the penalty that the 

bits of the instruction address and the second input bus 501 25 chances of failing to find a branch instruction are increased, 

is an instruction byte address bus for the least significant bits Where a jump instruction is not found this tends to lead to 

of the address of the instruction being fetched. In the a processing delay. 

- am ple cf FIG. L t h e fir st t aggJaiti^iheun^ I&^he^reseritly^ 

address form fee instmction word address and the lowest ab i e memory cells 510,-510,, are capable of storin* the 

two bits form the instruction byte address. An input on bus 30 word addrcs ses input over mekstruction word S bus 

*e\^ ~ ^ * t*- *? f *■ certain jSSS 

the instruction being fetched. Tht input 501 is med to ™ d stwed » * e content addressable 

controimeoutr^tofmeCAMairaysduringareadopCTation ^ U ? 0I1 . word address oftht ' mstniQ - 

so as to form a masking operation which prevents output <wirently being fetched is input over the instruction 

from any partition CAM corresponding to a byte location 35 WOrd address bus 500 when Mstruction word address 

within the word before the byte position input on line 501 corresponding to one of the stored addresses is input a 

(that is when the input on 501 is greater than the partition ^B"* 1 } occurs on &e corresponding one of the CAM 

number). If a read operation in the buffer 400 has a hit for 0Ut P ul 511r-511 n . 

the relevant instruction identified by the inputs 500 and 501. Selection circuitry 512 has an input connected to the 

the buffer provides a target address output on bus 507. an 40 instruction byte address bus 501 and is connected to the 

output 505, being a bus carrying information indicating CAM output lines 511 1 -511 n so as to selectively disable all 

which partitions contain branch instructions from the current of the lines of a partition. The selection circuitry 512 has a 

instruction word which are predicted taken, and an output processing circuit 560 receiving one input from the instruc- 

5 04 carrying information on the "prediction strength" tion byte address bus 501 and having an output 561 con- 

obtajfied from each partition. 45 nected to the control inputs of a plurality of pull down 

It will be understood that when a processor outputs an transistors 562^562,,. Each of the pull down transistors is 

address to memory to fetch a new instruction, that address connected between a respective one of the CAM output lines 

is simultaneously fed to the buffer 400 to carry out a read 511,-511, and earth. During reading, a control input 912 

operation and rf the processor executes a branch instruction stts the processing circuitry 560 to produce a Heb wtput so 

S^rr" 11^-^ fifV 0 ** ^ * * to Jn on all of the puU down LsisTon S3? £ 

instruction will be written into the buffer after execution by < . . ,/ . L . J ,~» m 

the processor so that it is ready for use at some later tin* T ? ^i ^^' ^ T* * C mstIuction ^ 

To write data into the buffer a word address mputbus^O ad *.f b ? S which k 15 ^ ^ *e number of the 

is provided to write word addresses into the CAM arrays, a f^ 00 * £ th,s ™* *? cacm * act to 

target address input 908 is provided to write into the data P^ 01 " where storcd braQC h instruction has a 

RAMs target addresses for the branch instructions which are 55 byte locatioQ the bvte position indicated on line 501. 

newly written into the buffer. As will be later described, any ™ selection circuitry 512 has output lines 513^513,, 

write operation into the buffer is carried out simultaneously which {oxm ^puts to pffediction processing circuitry 514. 

with a read operation using the same word address and input prediction processing circuitry 514 includes an 

900 as a read input 500 together with the appropriate byte n-location store 563 for storing prediction information for 

address 501 to avoid writing into the buffer any instruction 60 indicating whether an associated branch instruction is pre- 

which is already held in the buffer. Other inputs and outputs dieted taken or not taken. Each of the store locations 

are shown in FIG. 3 and their function will be described with 563^563,, corresponds to a respective CAM cell, and has an 

reference to the more detailed drawings of FIGS. 4. 8 and 9. output connected to one input of a respective two input AND 

As will be seen with reference to FIG. 3. each of the gate 564 1 -564 n . the other input of which is provided by a 

partitions 401-404 is substantially similar and therefore a 65 respective output line 513^513,, of the selection circuitry 

tetaUeddesaipuonofoneexeirml^ 512. As shown in HG. 8. an OR gate .565 is connected 

1x5 S* ve£L between the respective store locations 563 and the respective 
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si* 616 716 816 of the associated partition. In any one 
AND gate 564,-564,.. but as this » «"«JfS p"l St£ ononis such that when the respective NOR 
d^^^^J^^^^S^ gS516.616.716.816receivesalogicdlatanyoneo^ 
later be described herein, the prediction store 5 "stores a S conespondi ng output line 517. 617 717. 817 

logical 1 in positions corresponding to branch P the respe ctive first trans- 

S are predicted as taken, and a logical 0 in posiuon 5 g« ^ m g31 „ d ^ respective second 

ln-« n ondSe to branch instructions which are predicted as ™ Emission gates 530. 630. 730. 830 non-conductive. 
rSSeW7^escribedmorefullywithreferenceto ^is h^Xeffectlf interrupting the target address output 
8 allows the writing of new prediction information to me relevant part iUon and the next higher 

SfprSn store. Each AND gate 564^564,, has a that only the 

re peSve output which is connected to a respecuve output 10 * . & a j for ^ input of ^ NOR gate =,16 616 

Ss -515 of the prediction processing circuitry sl4. As 6 gl6 can ly a ^ add ress onto toe target address 
SowTmHcS ^ output bus 503. The output RAMs 518-818 are therefore 

StIchANDgate564 l -564 n andtheoutputline51 Sl - 3 l S „. c0 ^ e rt e d serially to a common path to the output503 and 
b„ al ;to£>orms a dkect connection during reafing * is ^trol gates 530-830 in that path allocate a decreasing 
omitted from FIG. 4 for clarity. The output lines 515^51^ 15 tQ ^ Qas progre ssively moving away from the 

formtoe word lines to a RAM 518 and also form inputs to * ^ onc partition can only provide an outout if 
an™i NOR gate 516. The NOR gate 516 is connected ^ ^ Qrity partition is providing an outout Once one 
m a sSle output line 517 which controls afirst transmission Qn ^ ^ output all lower priority partitions are 

oate 531 and a second set of transmission gates 530 Tne ^ prov ide an output 

NOR gate output lines 517-817 from all of the ypartmons 20 n rf ^ circuitry of rq. 3 will now be 

tooetoer form me output bus 505. As shown in HO. 8 toe , 
output of NOR gate 516 is connected to OR gate 590 which ^ branch ^tractions are stored in a parti- 

is omitted for clarity in FIG. 4 as it has no effect during a corresponds to toe byte position of toe branch 

read operation as toe signal on line 920 is 0 dunng a read. insttuction in its instruction word For example, datarelating 
During a write, the gate 590 in each partition receives a 25 ^ in ^ ^ byte position of any word 

signal 1 on line 920. is stor ed in first partition 401, data relating to a branch 

Data RAM 518 consists of n rows of storage cells, each ^fo^n in the Sficand hytepoiitiQianJfaMfiCM^ 
- r^cJw^radd^ ""SEnd so on. Hence for the example of FIG 1. 

STSs Each row of storage cells stores toe target 402isrel e V anttomstrucUons0.4.8 12andl6(toefirstb>te 
address of a branch instruction i.e. the address^ which » & of Mch Action word), toe second pardon ^402 
eSon will continue if the branch is taken. Tms infer- md es information relevant to instructions 1 

Son is male available on target address output bus 503. 13 ^ so on . Data indicating the presence of branch 

wS pas^ between the partitions, but which contains^ „ rtored m toe CAM ceUs 

ttansmission gates 530 for isolating partitions stomg data ^ tQ ^ redicti on as to whether the branch ^ ^taken 

reS to later bytes in toe same word as will be later 35 ^ ^ processing circuity- and data 

Sed Bu?503 is connected to sense amplifier orantry relatin t0 &c target address of the branch msttucbon ^ toe 
<£^«vta. a target address output 507 and a target address j » f ^ partit i 0 n. In the presently described example. 

t D uf90?LerdEgTwrite operation. The bus 503 forms ^ a whi £ indicates toe presence of branch instruction 
^^^^^P^ba^^^^^ „ is toe full instruction jvsid. address of the word containing 
wSreJecS^ the branch instruction. As will be later described herem ^ 

SfSh transmission gates in the bit line paths between ^ f0 ^ only a ^ 0 f the mstrucbon word address^ 

^rnTdataRAMs The data chosen for indicating the preseno : off branch 

Tne brtl^ge't buffer also consists of prediction- Actions is stored in the content addressable memory 

as first inputs, toe output hnes oi ™* " . * n ° wu ™ . „ sl0 of mc fost paxtiuon 401 

Srcuitry 512 and produces an output on predicUon.strengto 5 . it wfll ^^^"^^struct^^^ 104 

new prediction s T gto data. Tnis latter is described with ^^^^^^^u^^ 

reference to FIG. ». . v ' f A ; n «imiction words, namely the posiUons 

Referring again to FIG. 3. and as ^ *J. Xj££EKE.» and^.Tnus'toe ceUs 610 of 

remaining partitions have similar structures to that of toe 55 ~rr«ponam» to addresses "001" and "010" 

first partoioV401. The integers of each potion «toh» «2SS2w«. Entries are similarly made in the third 

similar to those of partition 401 have .^L r £""^ «™ (for address "011" corresponding to instruction 14) 

numerals, toe reference numerals of partition 402 being in partiUon^ (f J^J^ (corresponding to instruction 7). 

fte range 60<Mi99. those of P*mtion403 beuigin toe mnge and m toe fourto P ^atext of the instructions 

700-799 and those of partition 404 being m the range «, SuU referrm^toMU^ ^ ^ fe ^ WQfd 

800-899. . address bus 5oo of toe word address of the instruction 

As previously described, between each ^ D SSSy being fetched; this address is apptied by toe bus to 

tion id toe adjacent partition there is a first Jan^ 0D 2 sl0 . 610 . 710. 810 of the whole device 

gate 531.. 631. 731. 831 and a second set of transmission . , > - — 
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relevant output line of the CAM. If for example the word It will be noted that in the remaining i c first-third 

address 000 , corresponding to word 101 is input, no partitions 401-403. there will be no logical 1' s at the output 

matches occur in any partition and no CAM hits will occur. of the corresponding prediction processino circuitry 514 

However, when the word address "001* of the second 614, 714. As a result, NOR gates 516. 616/716 will have a 

instruction word 102 is input over bus 500. a match occurs 5 logical. 1 output which ensures that the corresponding trans- 

in the CAM cells 610 of the second partition (corresponding mission gates 530.630 etc; 531.631 etc are conductive and 

to instruction 5— see FIG. 1) and in the CAM cells 810 of that the corresponding lines of the predicted-taken bus 505 

the fourth partition (for instruction 7— see HG,..l). The will be at logical 1. save that line which is derived from the 

result of these matches is to produce CAM hits on the CAM fourth partition 404. This allows the output from partition 

output lines 611, 811 associated with the CAM cells where 10 404 to be output on lines 503 and 504. 

the hit occurred. For a second example, assume that fourth instruction 

An input is made to the byte input address bus 501 word 104 in the fetched sequence is the first instruction 

comprising the instruction byte address, which represents (instruction 12) to be executed. 

the position within the instruction word at which execution N ° w * reference to FIG. 1 shows that instruction 12 

is to commence. For the example shown in FIG. 5, assume *5 (having address 01100) is a non-conditional jump, 

that execution of the program segment shown is to start at Accordingly, every time execution proceeds to this 

the third instruction of second instruction word 102 instruction, the jump is taken and provided instruction 12 is 

(instruction 6 in FIG. 1). In this situation, the word address stored in the branch target buffer, the branch target buffer 

input over bus 500 would be "001" corresponding to the circuitry will provide a logical 1 on one output line 

second instruction word 102, This would produce CAM hits 20 515^-515,! of prediction processing store circuitry 514 in the 

in the second and fourth partitions. Referring again to FIG. ^ rst partition 401. It will be appreciated that the fact of 

1. it will be seen that the byte address for the third instruction taking branch instruction 12 means that execution will not "" 

of each instruction word is u 10 w . It is this address which is Meetly proceed to instructions 13. 14 or 15. As a result, the 

input on instruction byte address bus 501. . predicted outcome of branch instruction 14 is irrelevant 

The selection circuitry 512. 612. 712. 812 includes 25 bc . cai | sc that instruction will not be executed in sequence 

respective processing circuitry 560 (see FIG. 4) which acts with ^t™*** 011 12 - T*ie branch target buffer takes this into 

during reading to disable any partitions which correspond to account because the NOR gate 516 of the first partition 

"bytes-lower than the byte address input on instruction 'Byte ^^^^^a*^®^^ 

address input bus 501. Thus, for an input byte address of non-conductive the transmission gates 530. 531. thus iso- 

"00" no partitions will be disabled, and all selection circuits 30 lating ^ sec ond-fourth partitions 402. 403. 404. The.sec- 

will pass any hits from iaput 511.611 etc to output 513.613 0Qd set of * ansmissio * g^es 530 prevents the output of 

etc. If the instruction byte address is "01" then selection t2xg& } **** from * e ^ of fee partitions and, 

circuitry 512 of the first'partiuon 401 will not pass any hit as wU1 later be described herein, the first transmission gate, 

from an input line 511 ^511^ to an output line 513^513^ 531 Pt^^s the prediction strength of predicted unexecuted 

whereas in other partitions, any hit will be passed from input 35 br f^ h instructions from being updated. Although this pri- 

to output. If the instruction byte address were "10" then any ontisin g feature has been described for the unconditional 

hit would only be passed bv selection circuits 712 and 812 branch instruction 12. it should be noted that the branch 

in the third and fourth partitions, and so on. In the present tzrge } buffer h EOt aware ***** 12 h different to any other 

case, the instruction byte address is "10". thus enabling only predicted-taken branch. The buffer will therefore treat any 

selection circuitr)' 712 and 812 to pass any hit. However, the 40 other P^cted-taken branch in the same way. i.e. act to 

only hit which occurs is in the fourth partition and this is cxclud c any output for later branches in the same word, 

therefore allowed to proceed as an input to the prediction . Consider a farther example in which execution of instruc- 

processing circuitry 814 of the fourth partition ^ word 102 t0 comme nce from the first instruction of 

It wfll be recalled- that the prediction processing circuitry ™* 4 of FIG. 1. having address "00100"). 

514.614.714 and 814 each consists of respective prediction 45 P«V« <* this example that instruction 5 

store 563 which contains a logical 1 or lo^cai 0 indicating °m ?n * n v ^ instructio * 7 

whether a respective branch ^predicted as being taken or <°°\ n) 15 pTt<hCt * ^ ™ e word address is 

not,andpluralA^gates564.TheANDgatesrLiveone wm nr^W^M r AM /^J*** 

input from the associated selector circuitry and one from a *f°^ e fl l°^ ^ * the second and fouith 

respectiveentry of me register.^us. consMermg instruction 50 „ ^ 't^LT*" ° VCf 

No7ofHG.l.thepre^ SSS^^^SJ^ 2? * 

fourth partition wxU either produce a logical 1 from a ^^wf ^7 of tlie paruuons As mentioned 

respective AND gate on one of the output Hnes S15 (if the f above ^ e f f ad . dress causes Section circuitry 

branch instruction 7 is predicted as taken) or will r*oduce all ° fr0m u mpUt J? ? Utput * t0 P™ 1 ^ c °nesr*nding 

logical 0's on the output lines 815 (if the branch instruction 55 • t0 ^ precaction processing circuitry. In the 

7 is predicted as not token). presently described example, such logical 1's occur in the 

f. . - . second and fourth partitions 402 and 404 In the vrnnH 

T^^l tt JF*5? ** P fe ? cnc t° f a 1 ing circuitry stores a "not-taLn" prediction, and as S 

rS^ST" stored " 0001601 "ZVT* 0 Jas 00 braflCh ^ ^ " y 

f caus 7 * contrast, the predicuon processing circuitry 814 stores a 

1. The output of NOR gate 816 to go to logical 0 and; "taken" prediction in the position corresponding to word 102 

2. The corresponding row of the data RAM 818 to be and the application of a logical 1 Tut" provides a jump taken 
addressed, thus outputting onto the target address out- 65 output on one of the lines 815. This: 

put bus 503. The target address "00000*. i.e. the target a) Provides a logical 0 at the output of the associated NOR 

address of the branch instruction. gate 816; and 
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b) causes U.c^U RAM S18 of the fo^ 
} output a target address of "00000". As none of the gj™*^ haye , ktfcd , whic h causes the 

"lower" partitions has a »on-co D ducUve tran^on S7£ 572 • wd, ■* ^ th uU( g^ transis tors 573,-573 to turn 
gate. this target address is passed through to the target jsoaaie ^ ^ ^ ^ rf ^ 

address output bus 503. 5 ^rtfon st«e 571 to be pulled to logical 0 thus causing die 

As a result it can be seen that the J^gg^E prediction value stored in that stage to change to logical 0 
described above identifies only the first P«^d^ten pr tdictioa or remain at logical 0 if it was 

branch instruction of a sequence, which is not excluded tor in * ^£ 

Mention by being prior to the "J ^ffusly noted, there is a reactive ^nsmission 

sequence to be executed, and. more specifically, the target w ^ ^ ^ cQnnected ^ update ena b lf Une 502 
address of that instruction. . ^tween each partition and the next higher potion. It wui 

As previously discussed with respect to FIG. _ 4. each £™ ^ M transmission gate is conducive unless 

paction of the branch target buffer includes , "gg»J ^rSant partition has identified that a branch mstrucUon 
known herein as prediction strength processing c£C^ *°5 jc oredicted taken. In that event the transmission gate is 
Eg formation based on the history <* JgJfSStacdw during the operating cycle by fce 

instructions identified in the corresponding paruaon As renaer velineS17 . 617.717. 817 going to logical 0 kM the 
previously noted, FIG. 4 represents an exemj Any. pardon re^ecuve un gates are conducive. 

SoiandmepredictionstrengApr^ Sa^cal Z «oisappUedviameune502toaUpamnons. 

counterparts 650. 750. 850 in the other parUUons. a fa precharge level. The logical zero is then disconnected 

Prediction strength processing ^^ 5 f^ Ce ^ 20 foe line remains at that leveL Once transmission gates 

inputs from each of the selection crcuitry oufcut hn* 20 but tn non . CO nductive in partitions where a 

513,-513.. A logical 1 will occur on one of those output 531. Wi 8 one fa tQ the ^ 

Le 513-513. when a match ««s b^a " ^TneSase^ 

address input on bus 500 and data wdrcating theg^ence of 50^1 q ^ ^ n ^ a order up 

a branch instruction stored in one of .^ C*M «>dh M £3 includi ng 'any partition in which a branch insertion 
510,-510,,. provided the selector arcuitry 512 , i*s not 25 j 0 ^ ^ as ^ a ^ CTpartition s, regardless of whether 
disabledthepartitionbecause r^t^yZ or not th ey contain predicted-taken branch *«mc^fdo 
is prior to a first executed instrurtionof Jjej"™* ... ^recervU^^^ 

■B^HannhTllB^^ ^2^^^^;^^ updated . 

respective AND gate S70,-570„. The °J« to each preoi ^ ^ ^ orf 

AND gate is derived from a respective entry 57^571 °t ^^gvingpiurdbranctfinstnictionsatisd^irabkth^ - 
a store571 in the prediction strength processing gantry mseda ^ ^ g ^ place for aU 

which stores infonnation indicaUve of whether tiiere is me ^mictions which are not excluded from 

associated with the corresponding instruction a so-called ^ onl of ^ initial execution point ^within 

^ e ak" F edictionoraso^aUed"strong^predicUon In toe JJ^"* ^ first predicted-taken 

p Tesente^c^ent.aweakpredictionurepresen^bya 35 A leifall oftiie partitions 401-104 stored 
logical 1 stored in die corresponding stage, and a strong branj Asan wmp ^ ^ ^ and rf 

prediction is a logical 0 stored in the corresponding ^stage. ajng as ^ ^ ^ ^ . 
F A strong prediction indicates that a high degree of con exec a ot-taken) and the instruction in 

fidence that the presently stored prediction is correct and g JoSw^pedioted taken, then no update shoul be 
Aerefore unlikely to be changed whereas a weaipre^on 40 part*™ * P $ stored ^ partition 401 

Lcates a lower degree of 0^ this instruction could not be executed) and no 

the present prediction, and a greater ^ oodo ^ esM be performed upon the instrucuon represented 

The lines 513 r 513 n also form one input to bVpartition 404 (because ihe instruction represented by 
AND gates 572. 1 -572^..The other fry? 0 * 5 ^nableMine- ^02 45 partSn 403 is predicted as taken, thus 

572, -572. are provided by the update enable tine £*2. P ^ on 4e ^.rueoon represented by p^Uon 4M) 
whkhis common toall those g ates ^^ e ^° f n e t*£S Tnis update strategy is provided for in the branch target 
gate 572,-572. controls a respective pufl Uown tr^or "^e foUowing way: .. ' 

573. -573 . The pufl down transistors 573,-573 are con □ example the byte address input over byte 
^^^c^U^c^m^^P^ J^gSlZSlc*** L selection circuitry 512 to 
strength store 571. The oulput of the AND gates 570^,-570 50 aaore up wm fee nQ , 0gical B on 

oTinputs to an n-input OR ^gate 574 ^g ou^ut 504 fg^ n0 log5cal ones * be^oUedto 

which iidudes an output prediction strength line for each any _ ^ 55(> ^ t of ^ 

partition. „f,^in D ut NOR gate 516 remains at logical 1. Gates 530.531 remain 

P Inoperation.whenalogicallappearsononeof&einput ^^u^jve. Although there will be a CAM hit in the second 

lines S P 13,-513„ (indicating that a branch msttucUon ^has 55 conductive M ^ ^ 

been "hit" 'in the partition) that logical 1 is applied ^to one of pamuon appEed to the predicuon 

Se a5d gates 570,-570.. THe AND gate w^ produce -J^J*, ^ 

loeical 1 output only if the corresponding stage of the procc • -'^^jogj^o, atlhe lines 61S.Thus 

pSn sSgth store 571 stores a logical 1 c^espond- ^ U ^ f £^ logical 1 output and gates 630.631 are 
L to a "weak" prediction. If this occurs, then one of the 60 appearance of a predicted-taken result in the 

S;u«totheORgate574wmbeatIogicallandmeou^ ^^^ses me NOR gate 716 of thatpartition 
line 504 has a logical 1 indicating the PreJicUon «s weat ^ wde a ^ 0 output thus renderuignon^onducuve 
rnelogicallontheoneoftheinputnnes513,-5^isa^ top ^ ^ Hence the update line 502 
appUed to one input of a respecuve one of the AND gates ^ ^ ^ 401 _^ 03 and disconnected from 

57 &ofmeANDgate572,-572„ r .^^^^ $^1° ^ 
0 unless the utxlate enable line 502 is at logical 1 which logical i q 
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The prediction strength information, as mentioned above. 572. 573 etc to "strongly-taken". In this case, the iumo is 

ZT^lt^T v* 0 ™*™ ° D thC Ust0ly ° f id4Dtified ^ ThereisthusnofeLtoconecttheMe^So^ 

branch mstrucuons. which becomes the resolved a , ^ ^ 

Since the prediction of the outcome of a branch-i.e. cycle which includes fetching and executing the instruction! 

whether ornotajumpinstnicaonistaken-isrequiredatthe 5 At the start of cycle C 2 . the prediction is 

time of fetching, rather than executing, the branch and the jump is resolved as "taken". The interim update 

instruction, it is necessary to update the prediction for use prediction remains strongly taken and. as the iurnTu 

die next ttme that particular jump instruction is fetched. resolved a? being taken, the resolved prediction is likewis- 

depending on whether or not the prediction currently being "strongly taken" ""aisusewL. 

made is found to be correct or not during actual execution, to However, if in cycle C 3 the prediction is incorrect in that 

The strategy adopted » the Fcsendy-described embodiment the jump is resolved as being not taken the* SXSX 

. is to update automatically to an interim new prediction on applies: . "mowing 

S^t^^°«^ C ^T t ^ ti0n L i3 ' infaCt - COlreCt At a start of cvde C s fte solved prediction was 

^f'^^»" f ^tohave been correct when "strongly taken" and the interim update isttus "strong 

the branch is resolved, then the interim prediction is retained is taken". However, as the jump is resolved as "not-taken" me 

as the new prediction for the next execution, referred to prediction requires updating to "weakly taken" as shown! 

herein as the resolved prediction; only if the present predic- Thus it will be seen that the strength not the prediction is 

tion is found to have been incorrect is there a need for changed. 

^ e ^° n . an , d a ^ted 'resolved prediction' is stored. For the next cycle. Q. the prediction value is still "taken" 

The branch target buffer defaults to a state in which no 2b although "weakly-taken". Thus the interim update will be 

branch is predicted taken for any cycle in which corrections from "weakly-taken" to "strongly-taken", on the astuSoa 

are being undertaken. This enables the branch target buffer that the predicted behaviour is correct. If howeverYnce 

to be implemented as a single-port device, the single port again the jump is resolved as being not-taken the resolved 

being alternatively used for reading out of predictions and prediction ruuk be corrected to "weldy-notSS" m cSS 

"SS.fteSRT-,* , , • ^ 25 wor ^ ^ co ^<>° between the resolved prediction of 

Referring to FIGS. 6 and 7. the process of updating the cycle C, to the resolved prediction of cycle C. requires th- 

S^^^K!^ dKm " bed - B0th P rediction t0 be ^ rather than the strength to be 

wti&rfrand^howlhetistorro^ - changed i.e. fr om "weakly-taken " t o "weaklv-not-tak en w ' 

^ mtoA( ; branch ^buffer during an entry cy-cle . This change is made by an associative look-up in the present 

(E). For this explanation, it is assumed that each time the 30 embodiment. ^ 

branch' instruction is fetched, the previous execution of the Finally at the start of cfcle C s the prediction value is 

mstructionhas been resolved In practice, it may bepossible "weakly not taken", and accordingly the ''interim update" is 

for the instruction to be fetched again before a previous to "strongly not taken". Ifthejump is resolved as "nottaken" 

execution has been corseted, as will be later described the resolved prediction is "strongly not taken" whereas if the 

herein. On entry, the premction and prediction-strength for 35 jump is resolved as taken, then the resolved prediction 

any newly-entered jump insrructionis "weakly-taken" (wT). would be "weakly taken" 

The prediction and strength are stored as a logical 1 Turning to FIG. 7. the progress of a jump instruction is 

Oadicaong ndwi") in the prediction store 563. 663. 763. shown, in which the first Solution of thejGmp instruction 

863 and as logical 1 (indicating "weak") in the prediction after entry into the branch target buffer is incorrect Thus in 

strength store 591 etc. to form the prediction for that « cycle C t the interim update, assuming that the prediction 

m V£*nr?? u « wut f « ext value of "weakly taken* is correct, is to "strongly-taken". 

in both FIGS. 6 and 7. cycle C, represents the next cycle However, the jump is resolved as "not-taken". As a result 

mwmch the presently-considered instruction is fetched, and the resolved prediction is 'Veakly-not-taken" in other 

the resolved prediction for the previous cycle represents the words requiring the prediction at the start of the cycle which 

prediction state at the time of next fetching of the « was "weakly taken" 'to be corrected to 'Veakly noftS" 

™^?cV e - 6 , Start ° f & ' *de C »- Similarl y- C * At fte start ">eakly not taken" prediction 

represents the second occasion at which the branch instruc- gives an interim update of "strongly not taken" and the 

* . t *f reSOlved P^^ 011 for c y cle C, resolution of the jump as "not taken" confirms this value as 
represents the predicted value for cycle C,. and so on. In the resolved prediction. However, in cycle C, although th- 
eacfc cycle, the prediction is updated to provide an interim 50 interim update is to remain at "strongly not taken" tfa- 
update by assuming that the prediction at the start of the . resolution is "taken" thus the resolved prediction at the end 
cycle was correct If execution of the instruction shows that of cycle C, is "weakly not taken". Le. once a°ain chaii<an« 
toe prediction was in fact wrong, the interim update is only the strength of the prediction, rather than the predion 
discarded, and a new prediction made. This new prediction itself. This should be contrasted with cycle C4. in which the 
is based upon the previous prediction, at the start of die 55 predicted value of "weakly not taken", at the start of tri- 
cycle and the knowledge gained during resolution. cycle is updated to "strongly not taken", but as a result of a 
in the presently-described embodiment, a branch instruc- resolution to "taken" is finally corrected to "weakly taken" 
uon is put in the BTB only when it is executed to cause a in other words a change of prediction value, not of predic- 
jumptromthe normal sequence. Thus at the end of the entry tion strength. ~ 

3r rr*? " reso, " tion * *• histoi y of ^ i um P instruction is so The above discussion of the branch target buffer generally 

Rrf^rfn 1 !,? 0 ^ < * rehteS ,0 4(5 circuitt y ™ °P« a u°n of the buffer in the 

Kerernng to FIG. 6 the progress of a jump instruction reading mode. Details of the circuitry for writing informa- 

whichis initially correctly-predicted will be described: tion to an exemplary Ike of the branch target buffer will now" 

At the start of cycle Q. the prediction is "weakly taken" be described with reference to FIG. 8. FIG. 8 shows an 

~m •! * a jl Umed ° y * e branch target buffet mat J' um P 65 extm P Iar y content addressable memory cell 510.. being one 

wfll m fact be taken. Thus the prediction state is updated by of the memory cells 510 in the first partition 401 This 

the update enable line 502 and the automatic update circuitry content addressable memory cell 510. for the ouroose of 
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The output line 511, passes to the previously-described J^S" 11 " T ° ^ ^ responds 

selection circuitry 512 to an output line 513.. In the writins 1 !" entI ?. C0 ^ Dd frora the P"** 55 ^ HO to cause the 

mode, input 912 changes state from the reading mode so that ^ l °f a Ut Lne 907 10 a h * €al 1 input 

the selection circuitry 512 provides a connection between its 7? e ^ 9 t0 sup ? ly a lo S ical L *e write enable line 

input and output where the partition is to be written to. but 10 b goes *? io S ical tne of me second OR gate 906, 

for all other partitions, the circuit 512 provides no connec- goes t0 kfcfc* 1 enabling the traiismission gates 918, 

tion between input and output and those outputs are held to and 91 ?* and causing the logical 1 on line 907 to be written 

ground. The content addressable memory cells are not only t0 prediction store register stage 563* (representing "taken") 

connected to instruction word address input bus 500. but and the logical 1 on second input line 909 to be written to 

also to instruction word write bus 900. This bus is connected 15 we prediction strength stage 571, (representing "weak"), 

to each of the storage locations of the content addressable The address data presented on the write bus 900 is written 

memory. Each of the storage locations of the content addres- to the CAM cell 510,, as has previously been mentioned A 

sable memory 510, is also connected to a write enable line logical 1 applied to line 920 ensures that OR gate 590 

901* which receives a logical 1 when the address currently outputs a logical 1, thus rendering conductive the transmit 

appearing on the write bus 900 is to be written to the content 20 sion gates 530. The presence of the logical 1 on the write 

addressable memory ceU 510,. A separate write enable line enable line 901, provided at the input to the first OR gate 

u provided for each line of each partition. A further control 905, however provides a logical 1 on the word line 515, to ' 

line 920 connected to all partition lines is provided to allow the RAM line 518, and this allows target address data to be 

a new target address to be written to the RAM 518. written into the corresponding line 518, of the RAM 518 

The CAM output line 511, is connected to a pull down 25 from a target address input signal 90S 
transistor 902,. operable to pull the CAM output line to To ensure that there is always an empty line available for 
7? e tna ^ M m * is CODtroIled by a each partition, every time a new branch is entered into the 
-<^fi*f^ateb-903^hiefi^ --bra nch uu ge t bu ff er a rarrdoTinmmSeT generator 113 ran 
^P ut ^W, tomarktheline valid or invalid. The write enable domly selects a line in the partition which is to contain the 
ha! 6 1S ******* t0 0QC input of a two-input OR 30 new branch and inputs^ bit to the corresponding latch 903 
gate 905,, whose other input is provided by the output of the so that the line is marked invalid. The line to which the 
AND gate 564, of the prediction processing circuitry 514 branch instruction is written is selected using a stored value 
and the output of the OR gate 905, forms the word line 515, of the previously-selected random number. This technique 
for row b of the RAM 518. The write enable line 901, is requires afirstrow decoder 115 to output signal 904 to mark 
further connected to one input of a two-input OR gate 906,. 35 a line as invalid and a second row decoder 114 to output 
Register entry 563, of the prediction store 563 is connected signal 901 to add the new branch. The random number 
to a prediction change enable line 907 allowing a new generator 113 is configured to ensure that the same number 
prediction to be written in from the circuit 102 via a is not generated twice successively 
fransmission gate 918, The output of the prediction store FIG. 9 shows schematically a computer system includin* 
563, is coupled via an OR gate 565, to the AND gate 564,. 40 the branch target buffer 400 already described. A processor 
Thesecondmputof the OR gate 56S, is provided by the 110 is arranged to fetch instructions from a memory 111 and' 
control line 920. Prediction strength storage location 571, is execute aplurality of instructions in a pipelined process The 
connected to an input line 909 allowing a new prediction fetch address is output on line 120 and this full instruction 
strength to be written in from state transition circuitry 112. address is supplied to the memory HI and. at the same time 
via a second transmission gate 910,. The output of the 45 through a multiplexor 116 to the inputs 500 and 501 of the 
second OR gate 906, provides the control input to the buffer 400. The fetched instructions are supplied on line 121 
transmission gates 918, and 910,. The second input to the to the processor for execution and me results of a read 
secondOR gate 906, is provided by the output of an AND operation in the buffer 400 are fed on Knes 504 505 and 507 
gate 9l\ b which receives a first input from the corresponding to the processor 110 as previously described. At an appro- 
output of the selection circuitry 512 and a second input from 50 priate stage in the processing pipeline, the processor 110 will 
an input line 912. resolve each branch instruction to determine whether it is in 

As previously mentioned, the write enable line 901, is fact taken. If the processor had received from buffer 400 a 

operative to allow an address input over write bus 900 to be correct prediction for the branch instruction then no write 

stored in the CAM ceil 510,. At all times, one line in each operation is required. If however no correct prediction had 

partition comprising a CAM cell, a line of the target address 55 been received (due to either absence of the branch instruc- 

RAM and the associated prediction and prediction strength tion in the buffer, or a misprediction by the buffer) then the 

data is identified as invalid, and it is mis invalid line to which processor outputs signals to the circuit 112 to cause a write 

a new branch jump instruction is written. To render the The output on lines 122 comprises the result of resolution 

partition line shown in HG. 8 invalid, an input is applied to the prediction , for that branch instruction, the appropriate 

control input 904, so as to set the latch 903* and render 60 target address and branch address for me instruction These 

conductive the pull down transistor 902,,. Trie effect of this are input to the state transition dicuivy 112 which provides 

is to pull the CAM output line 511, down to earth. The CAM outputs 900. 907. 908. 909, 912. 92* and 502 to the branch 

output line is then held at earth until a new predicted value target buffer 400 and an output on line 123 to operate the 

is wntten into this line and the line is marked valid, as later random number generator 113. Hie state transition circuitry 

aesoibed. ^ ^ rcsp onsive to the resolution and prediction received on 

pj. 0 ?^ te . a Dew Preteted-tofcn entry to the line shown in line 122 to change prediction or strength of prediction values 

HU. 8, it is necessary to write information to the content in accordance with the resolution of the branch instruction 
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by the processor 110 as has previously been described with line 912 permits the writing of a new prediction, and 

reference to FIGS 6 and 7. The updated prediction and prediction strength bit into the corresponding line of the 

strength of predict^ values are input on lines 907 and 909 prediction store 563 and the prediction strength store 571. 

to till appropriate entry in the buffer 400. together with the For re-writing of the : value of predicUon. or prediction 

io uic apprupuai* cimj 1U ** A a~** strength, the writs enable line 901. is kept at a logical 0. thus . 

other data to be written including the target address on Une 5 - ^ ^ rf a ^ ^ ,J q ^ ^ ^ ^ 

908. as already described with reference to FIG. 8. to tnis £ new addr£S5 > mtQ ^ 518 and ^ ^4 

case the output bus 900 outputs the full instruction address iflput ^ 9U i$ takcn t0 logical L p resC nce of a logical 

of the instruction for the write operation into the buffer and ^ (corresponding to the match between the input instruction 

line 900 forms a second input to the multiplexor 116. Line address on bus 500 and the information stored in the CAM 

912 is set to logic 1 corresponding to a write operation and 10 cell 510 6 ) provides a logical 1 to the second input of the 

the multip!exorll6 selects the address from line 900 to input AND gate 911^ and a consequent logical 1 on the output of 

on lines 500 and 501 to cany out an associate operation on the second OR gate 906 fe . This causes the transmission gates - 

the CAM of the partition denoted by 501 in parallel with the 9lS b and 910^ to be rendered conductive, which allows the 

write operation caused by an input on line 900 directly into new prediction value to be input over line 907, and the new 

the buffer 400. The input 900 which is received directly by 15 prediction strength value to be input over Une 909 

the buffer 400 inputs only the word address bits of the In the emboaiment described with respect to FIG. £ no 

uic tnxuti jj y . c . . OA\jf o^*,rc prediction is performed in cycles where a write is being 

from the address line 120 a second ^^ 0 ^J!^ It will be appreciated that the above example provides 
and a control input 126 from line 912. Li each write 20 circuit ^ for ™ cntijQ g a double entry m the buffer for the 
operation the branch target buffer receives an input from the braflch ^struction. It will be understood that due to the - 

state transition circuitry 112 attempting to correct the con- p roce ssor pipeline, there will be delay between a new branch 
tent of the buffer without distinguishing whether the error is instruction first being fetched by the processor and found 
due to a misprediction or to the absence of the branch absent from the buffer and the final resolution of the branch 
instruction in the buffer. Accordingly, the embodiment per- 25 instruction by the processor which would then result in a 
forms me same remedial action in both situations as follows: output through the state transition circuitry 112 to cause a 

1. Input the instruction address to the branch target buffer write into the buffer. Although it is unlikely, it is possible 

for an assnriafiyr F>atrh At th* timc-begia -to that-the^ame^bmchnns u uOiou could be .fetched a second " 

write to the invalid line in the branch target buffer. time by the process or before the first execution of the branch 
information as if the error had occurred due to absence 30 instruction has been resolved and caused a write entry into 
of the branch instruction in the buffer. If the prediction the buffer. It is therefore important to avoid the possibility of 
on line 907 is taken, then line 920 is set to logic 1 to writing the same branch instruction into the buffer at a 
allow the new target address to be written. In this case . different location when the mstrucUon was not located in toe 
ri^MO to^i^dcs 1 10^ 1 input to AND buffer the first time it was fetched by the pressor. It is for 
signal y-w uculo j. piuviu« a 5 f this re ason that an associative match on the branch address 

gate 564,. If a CAM hit occurs iforthat partition row 35 ^^^^^soo^Ol in paraUel with the write input 
signalS13 d is set to logical 1 and consequent OR gate ^ ^ m Qf ^ mult £ lexor u6 enJUres ftat ^ 

905* provides a logical 1 on wordhne 515* ot the address 0Q bus 900 ^ used simultaneously for both 

RAM. In addition, line 920 causes circuit 506 to put the operations and while this parallel operation is carried out the 
new target address from input 908 onto the RAM bit ^ written to is held invalid. If that associate opera- 
lines 503 so that the new target address is written into 40 tion a bit then the new entry line is maintained as 
that row of the RAM. invalid and a line on which the hit occurred has a modifi- 

2. If the associative match creates a hit. this indicates that cation of the prediction strength and target which have been 
the branch had already been located in by the branch input from the state transition circuitry 102. In the event that 
target buffer. As a result, the (supposedly new) branch no hit was found by the associate operation the new entry 
entry is terminated by leaving the relevant line as 45 line is marked as valid and a further new line is invalidated 
^vaiid. ready for a next write operation. 

If the associative match resulted in no bit then the cause In FIG. 3. both the content addressable memory cells 510. 

of the etror was a genuinely new branch, and the line to 610. 710 and 810 and the date RAMs 518 618. 718. 818 

which information is being written is marked as valid. In this store full addresses. In the case of the CAM ceU. by full 

case none of lines 513 is high so that an output 923 of an OR so address" is intended the full fetch address of each instruction 

»ate 922 is low. Line 923 forms an input to latch 903* so as word. For the data RAM. the "full address means the 

to clear the latch for the new line selected by the high signal precise target address, or the offset from the address of the 

on line 901„ thereby marking that line as valid. Signal 903 branch instruction to be executed, 

causes latch903* to respond to signal on line 904 6 to mark In one embodiment .however _both the CAM cells and the 

a new line as invalid 55 data RAM store less than full addresses. Specifically, before 

If a wron° prediction has been made, then the automatic a branch instruction is entered into the branch target buffer 
update properties of the described branch target buffer may the most significant bits of the fetch address of the branch 
result in a wrong value of interim update of prediction being instruction are compared with the most significant bits of the 
stored. Reference should be made to FIGS. 6 and 7 and the target address of that instruction. This is done by a corn- 
corresponding description. It will be seen from this that if parator circuit in circuitry 400 of FIG. 9 comparing 
the original prediction were "weakly taken" or * Nveakly not 60 addresses on output hues 900 and 908 from the circuit 112. 
taken" Mduie resolution of the branch was in the opposite If the contents of a selected number of the most significant 
direction. Le. "not taken'' or "taken", then a new value of bits of the fetch address differ from the contents of the same 
prediction must be written to the prediction store 563. If on number of selected bits of the target address, no entry is 
uie other hand the originally-stored prediction was "strongly made into the branch target buffer. If however the contents 
taken" or "strongly not taken" then if the branch were 65 of the selected number of most significant bits are the same 
resolved in the opposite direction, the strength bit must be in both the fetch address and the target addressee, the 
re-writtea. Accordingly, referring to' FIG. 8 the third input result of taking the branch instrucuon would be to branch to 
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a new fetch address which is in the same or an adjacent 
program segment to the fetch address of the instruction 
itself, then the lesser significant bits of the target address of 
that instruction are stored in the branch target buffer. Simi- 
larly only the same number of lesser significant bits of the 5 
branch instruction address are written from line 900 into the 
buffer 400. 

In this arrangement, during instruction fetching, the full 
fetch address is applied to the branch target buffer, although 
only the relevant number of lesser significant bits of the 
fetch address are input over the instruction word address bus 10 
500. It will be apparent to one skilled in me art that in this 
embodiment instructions from other program segments. i.e. 
instructions having most significant bits which differ from 
the most significant bit of the branch instruction which is 
stpred may have identical lesser significant bits to those of 15 
the instructions stored The consequence of this is that 
instructions may be falsely identified as being branch 
instructions, with consequent processing delays. 

We claim: 



